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Allele Calling for di-nucleotide marker in linkage mapping 

application Sample Data (3) 
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Figure 10 



Allele Calling for di-nucleotide marker in linkage mapping 
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Allele Calling for di-nucleotide marker in linkage mapping 

application Sample Data (1) 
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allele peaks (indicated by red star) 
when serious differential 
amplification occurs* 
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Figure 12 
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Figure 13: Standard heterozygous allele signature. Circles denote user annotated allele calls, 
axis is in base pairs, y-axis is in AID counts (voltage intensity) 
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Figure 14: Steps in the allele calling routine. First the signal is simplified via sampling and its 
peaks are located. This forms the target signal that is to be approximated. The two 
n interconnected boxes indicate the process of varying the parameters and testing how closely the 
K resulting signal matches the sampled version of the original. The set of parameters that yield the 
yS closest match contain the allele calls. 




Figure 15: Illustration of hypothesis formation in the optimizer routine. The two columns of 
Figures represent the optimal solution (left column) and a suboptimal solution (right column). 
Panel (a) shows the target vector with the two red lines showing the location of the candidate 
peaks. Panel (c) shows the hypothesis formed using different values of stutter and + A. Panel (c) 
shows the residual error resulting from subtraction of the signal in panel (c) from the signal in 
panel (a) (sum squared error = 0.0355). Panels (b,d,f) show the same process for for a slightly 
different allele hypothesis. This is a poor hypothesis and the residual is rather signifcant ( SSE - 
0.4715). The x-axis is somewhat meaningless at this point since it gets mapped back to base-pair 
indices after the winning hypothesis is chosen. 
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Figure 16: Division of heterozygous signal into panels by the Envelope Call* ■ algonta u Tbe 
pS L ranked according to signal energy and the three 

U the two panels containing strong allele signages ^^^^^T 
annotated allele calls, x-axis is in base pairs, y-axis is in Ay© counts (voltage intensity) 
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Baselining a Gradual Slope 
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Bottom Baseline with Negative Structure 
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Figure 20: The bottom baseline is shown in blue. Note the two downward 
spikes, around data points 1700 and 1900 in the primer peak. 
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Figure 21 : Bottom baseline after eliminating the negative 
spikes. 
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Figure 22: Baselining by averaging the top and bottom. 
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Figure 23: The baselined signal. Note negative values. 
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Figure 24: A signal (blue), its first derivative (red), and 

the positive-to-negative zero crossing (magenta). 
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Figure 25: Detail of the peak location. 




Figure 26: Set symmetric parameters if the peak i 
symmetric. 




Figure 27: Do not set symmetric parameters if the peak is not symmetric. 




Figure 28: Set asymmetric parameters if the peak is not symmetric. 
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Figure 29: Calculating the first derivative aUT =30. 




Figure 30: Calculating the first derivative by fitting 
polynomials. 




Figure 3 1 : Using k to smooth the derivative. 




1CE 



Figure 32: Three peaks for k = 5, one peak for k = 21 
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Figure 33: Twin towers detected (where only one exists.) 
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Figure 34: How to avoid the sharp corner artifact. 




Figure 36: Finding a left-bank shoulder by analyzing the second derivative. 




Figure 37: Finding a right-bank shoulder by analyzing the second derivative. 
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Figure 38: The complete analysis of the peak for shoulders. 




Figure 39: Example Peaks, Sizes, and a Matching. 

Peaks have red bases and green positions. 



Run-time vs Input Size 




Figure 40: A mesh of execution times. 




Figure 41 : Each curve holds constant the number of extra peaks m. 
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Figure 42: Each curve holds constant the number 

n of sizes in the size-standard definition. 



Three points, collinear in mobility space 
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Figure 43 : Three collinear points in mobility space. 
The blue line is a linear interpolation. 
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Figure 44: The three points from Figure 43 shown in size-versus-scan space. The 
blue line is a linear interpolation in mobility in mobility space and the red line is a 
linear interpolation in scan space. Note that the red line is not smooth at scan 15. 



40 



35 



Sizing Scan 17 



* Size Standard 

— Linear interpolation of mobility 

— Usual Southern 



CD 
N 

tn 



30 



o 25 



LL 



20 



15 



10 



10 




15 20 

Scan number 



25 



Figure 45: Three points that are collinear in size-versus-mobility space, and three 
that are collinear in size-versus-scan space. The blue line is a linear interpolation 
in mobility space and the green line is a linear interpolation in scan space. The 
two curves are averaged at scan 17. 
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